Видео с ютуба Dense Captioning
DenseCap: Fully Convolutional Localization Networks for Dense Captioning
ActivityNet Event Dense-Captioning
Molmo 2 | Dense Captioning
From Images to Videos: PLLaVA's Breakthrough in Video Dense Captioning
[CVPR 2024] Streaming Dense Video Captioning
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining
Dense Captioning of Images - Video Demo
Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning
Dense captioning with Azure Computer Vision 4.0 (Florence)
Dense Video Captioning with Semantic Features and Attention
Bi-directional Contextual Attention for 3D Dense Captioning
Improving Descriptive Deficiencies with a Random Selection Loop for 3D Dense Captioning
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
Multimodal Pretraining for Dense Video Captioning
Generation of a descriptive paragraph of an image using dense captioning
iPerceive | Applying Common-Sense Reasoning to Dense Video Captioning and Video Question Answering
Создавайте подписи к изображениям, которые фокусируются на том, что вам нужно
ActivityNet Dense Event Captioning Results
Dense Motion Captioning: CompMo Dataset & DEMO